fix(mothership): tool call loop by Sg312 · Pull Request #3729 · simstudioai/sim

Sg312 · 2026-03-24T00:44:50Z

Summary

Rewrite mothership tool call loop

Type of Change

Bug fix

Testing

Manual

Checklist

Code follows project style guidelines
Self-reviewed my changes
Tests added/updated and passing
No new warnings introduced
I confirm that I have read and agree to the terms outlined in the Contributor License Agreement (CLA)

…keyboard shortcuts, audit logs

…rects to rewrites

…stash, algolia tools; isolated-vm robustness improvements, tables backend (#3271) * feat(tools): advanced fields for youtube, vercel; added cloudflare and dataverse tools (#3257) * refactor(vercel): mark optional fields as advanced mode Move optional/power-user fields behind the advanced toggle: - List Deployments: project filter, target, state - Create Deployment: project ID override, redeploy from, target - List Projects: search - Create/Update Project: framework, build/output/install commands - Env Vars: variable type - Webhooks: project IDs filter - Checks: path, details URL - Team Members: role filter - All operations: team ID scope Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * style(youtube): mark optional params as advanced mode Hide pagination, sort order, and filter fields behind the advanced toggle for a cleaner default UX across all YouTube operations. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * added advanced fields for vercel and youtube, added cloudflare and dataverse block * addded desc for dataverse * add more tools * ack comment * more * ops --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> * feat(tables): added tables (#2867) * updates * required * trashy table viewer * updates * updates * filtering ui * updates * updates * updates * one input mode * format * fix lints * improved errors * updates * updates * chages * doc strings * breaking down file * update comments with ai * updates * comments * changes * revert * updates * dedupe * updates * updates * updates * refactoring * renames & refactors * refactoring * updates * undo * update db * wand * updates * fix comments * fixes * simplify comments * u[dates * renames * better comments * validation * updates * updates * updates * fix sorting * fix appearnce * updating prompt to make it user sort * rm * updates * rename * comments * clean comments * simplicifcaiton * updates * updates * refactor * reduced type confusion * undo * rename * undo changes * undo * simplify * updates * updates * revert * updates * db updates * type fix * fix * fix error handling * updates * docs * docs * updates * rename * dedupe * revert * uncook * updates * fix * fix * fix * fix * prepare merge * readd migrations * add back missed code * migrate enrichment logic to general abstraction * address bugbot concerns * adhere to size limits for tables * remove conflicting migration * add back migrations * fix tables auth * fix permissive auth * fix lint * reran migrations * migrate to use tanstack query for all server state * update table-selector * update names * added tables to permission groups, updated subblock types --------- Co-authored-by: Vikhyath Mondreti <vikhyath@simstudio.ai> Co-authored-by: waleed <walif6@gmail.com> * fix(snapshot): changed insert to upsert when concurrent identical child workflows are running (#3259) * fix(snapshot): changed insert to upsert when concurrent identical child workflows are running * fixed ci tests failing * fix(workflows): disallow duplicate workflow names at the same folder level (#3260) * feat(tools): added redis, upstash, algolia, and revenuecat (#3261) * feat(tools): added redis, upstash, algolia, and revenuecat * ack comment * feat(models): add gemini-3.1-pro-preview and update gemini-3-pro thinking levels (#3263) * fix(audit-log): lazily resolve actor name/email when missing (#3262) * fix(blocks): move type coercions from tools.config.tool to tools.config.params (#3264) * fix(blocks): move type coercions from tools.config.tool to tools.config.params Number() coercions in tools.config.tool ran at serialization time before variable resolution, destroying dynamic references like <block.result.count> by converting them to NaN/null. Moved all coercions to tools.config.params which runs at execution time after variables are resolved. Fixed in 15 blocks: exa, arxiv, sentry, incidentio, wikipedia, ahrefs, posthog, elasticsearch, dropbox, hunter, lemlist, spotify, youtube, grafana, parallel. Also added mode: 'advanced' to optional exa fields. Closes #3258 * fix(blocks): address PR review — move remaining param mutations from tool() to params() - Moved field mappings from tool() to params() in grafana, posthog, lemlist, spotify, dropbox (same dynamic reference bug) - Fixed parallel.ts excerpts/full_content boolean logic - Fixed parallel.ts search_queries empty case (must set undefined) - Fixed elasticsearch.ts timeout not included when already ends with 's' - Restored dropbox.ts tool() switch for proper default fallback * fix(blocks): restore field renames to tool() for serialization-time validation Field renames (e.g. personalApiKey→apiKey) must be in tool() because validateRequiredFieldsBeforeExecution calls selectToolId()→tool() then checks renamed field names on params. Only type coercions (Number(), boolean) stay in params() to avoid destroying dynamic variable references. * improvement(resolver): resovled empty sentinel to not pass through unexecuted valid refs to text inputs (#3266) * fix(blocks): add required constraint for serviceDeskId in JSM block (#3268) * fix(blocks): add required constraint for serviceDeskId in JSM block * fix(blocks): rename custom field values to request field values in JSM create request * fix(trigger): add isolated-vm support to trigger.dev container builds (#3269) Scheduled workflow executions running in trigger.dev containers were failing to spawn isolated-vm workers because the native module wasn't available in the container. This caused loop condition evaluation to silently fail and exit after one iteration. - Add isolated-vm to build.external and additionalPackages in trigger config - Include isolated-vm-worker.cjs via additionalFiles for child process spawning - Add fallback path resolution for worker file in trigger.dev environment * fix(tables): hide tables from sidebar and block registry (#3270) * fix(tables): hide tables from sidebar and block registry * fix(trigger): add isolated-vm support to trigger.dev container builds (#3269) Scheduled workflow executions running in trigger.dev containers were failing to spawn isolated-vm workers because the native module wasn't available in the container. This caused loop condition evaluation to silently fail and exit after one iteration. - Add isolated-vm to build.external and additionalPackages in trigger config - Include isolated-vm-worker.cjs via additionalFiles for child process spawning - Add fallback path resolution for worker file in trigger.dev environment * lint * fix(trigger): update node version to align with main app (#3272) * fix(build): fix corrupted sticky disk cache on blacksmith (#3273) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Lakee Sivaraya <71339072+lakeesiv@users.noreply.github.com> Co-authored-by: Vikhyath Mondreti <vikhyath@simstudio.ai> Co-authored-by: Vikhyath Mondreti <vikhyathvikku@gmail.com>

…t support

… fixes, removed retired models, hex integration

…mprovements

…ogle tasks and bigquery integrations, workflow lock

…umentation

…gespeed insights, pagerduty

…, brandfetch, google meet

… pagination, memory improvements

… selectors for 14 blocks

…ory instrumentation

…aders, webhook trigger configs (#3530)

…anvas navigation updates

…blog post

…g fixes

…ol fixes

…enaming

vercel · 2026-03-24T00:44:56Z

The latest updates on your projects. Learn more about Vercel for GitHub.

1 Skipped Deployment

Project	Deployment	Actions	Updated (UTC)
docs	Skipped		Mar 24, 2026 1:06am

greptile-apps · 2026-03-24T00:51:45Z

Greptile Summary

This PR fixes a tool-call loop in the Mothership/Copilot orchestrator by replacing Redis polling with an in-process pub-sub event system and adding a claim-based delivery mechanism to prevent async tool results from being processed more than once.

Key changes:

Tool call loop fix: Replaces the old waitForToolDecision / waitForToolCompletion Redis polling loops with waitForToolConfirmation (pub-sub, DB as source of truth). Removes the now-unnecessary promptForToolApproval option and its entire code path from handlers.ts and the OrchestratorOptions type.
Claim-based async delivery: The orchestrator now atomically claims completed async tool rows (WHERE claimedBy IS NULL) before resuming the Go stream, marks them delivered on success, and releases the claim on failure — preventing the double-resume that caused the original loop.
New delivered enum value: Added to copilot_async_tool_status with a safe multi-step Postgres migration.
Abort signal propagation: cancelRunToolExecution creates and aborts an AbortController whose signal is passed into executeWorkflowWithFullLogging, giving proper cancellation of in-flight workflow runs.
buildToolCallSummaries fix: Pending/executing tools without a result no longer collapse to success, fixing incorrect status reporting for subagent tools.
PPTX write validation (workspace-file.ts): Before storing LLM-generated PptxGenJS code, the server now runs it to verify it produces a valid PPTX — but the validation uses an unsandboxed new Function() in the main process instead of the existing sandboxed subprocess in @/lib/execution/pptx-vm. This is a security regression that needs to be addressed before merge.
Several new focused unit tests cover the async continuation path, claim semantics, pub-sub wakeup, stream replay cancellation, and abort signal passthrough.

Confidence Score: 2/5

Not safe to merge as-is due to a server-side code injection vulnerability introduced in the PPTX validation path.
The core orchestration fix (claim-based async delivery, pub-sub confirmation, abort propagation) is well-designed and thoroughly tested. However, workspace-file.ts introduces an unsandboxed new Function() call to validate LLM-generated PPTX code in the main server process, bypassing the existing sandboxed subprocess executor in pptx-vm.ts. This is a direct server-side code execution vulnerability. The fix is a one-line import change, but it must happen before merge.
apps/sim/lib/copilot/tools/server/files/workspace-file.ts — the generatePptxFromCode function must be replaced with the import from @/lib/execution/pptx-vm.

Important Files Changed

Filename	Overview
apps/sim/lib/copilot/tools/server/files/workspace-file.ts	Adds server-side PPTX validation via a locally-defined `new Function()` executor, bypassing the sandboxed subprocess in `pptx-vm.ts`. Critical security regression.
apps/sim/lib/copilot/orchestrator/index.ts	Core fix: replaces sequential async continuation with a claim-based loop that prevents double-delivery of async tool results. Adds `markAsyncToolDelivered` and `releaseCompletedAsyncToolClaim` for correct lifecycle management. The `return` removal from the `done` event handler is intentional and tested.
apps/sim/lib/copilot/orchestrator/persistence.ts	Replaces Redis polling with an in-process pub-sub channel for tool confirmations. New `waitForToolConfirmation` uses a double-check pattern (before and after subscribe) to correctly handle the race window. Well-tested.
apps/sim/lib/copilot/orchestrator/sse/handlers/handlers.ts	Removes `promptForToolApproval` path entirely and replaces fire-and-forget upsert with a properly awaited upsert (with error logging) before tool execution. Workflow run tools are now correctly forced to the client path.
apps/sim/lib/copilot/async-runs/repository.ts	Adds `getAsyncToolCall`, `getRunSegment`, `markAsyncToolDelivered`, `releaseCompletedAsyncToolClaim`. `claimCompletedAsyncToolCall` uses an atomic `WHERE claimedBy IS NULL` predicate to prevent double-claiming. `runId` is NOT NULL in the DB schema so the null-pass concern in `getRunSegment` is safe.
apps/sim/app/api/copilot/confirm/route.ts	Replaces Redis-based confirmation with durable DB upsert + pub-sub wakeup. Adds ownership check (`run.userId !== authenticatedUserId`) to prevent cross-user confirmation. `existing.runId` is always non-null per the DB `NOT NULL` constraint.
apps/sim/lib/copilot/async-runs/lifecycle.ts	New module centralising async tool status constants and lifecycle predicates. Clean, well-tested helpers for `isTerminalAsyncStatus`, `isDeliveredAsyncStatus`, and `inferDeliveredAsyncSuccess`.
apps/sim/lib/copilot/client-sse/run-tool-execution.ts	Adds `AbortController` per workflow execution and exposes `cancelRunToolExecution`. `markRunToolManuallyStopped` now returns the `toolCallId` so callers can pass an explicit override to `reportManualRunToolStop`, fixing a potential race between map deletion and the report call.
apps/sim/lib/copilot/orchestrator/stream/core.ts	Fixes `buildToolCallSummaries` to no longer collapse `pending`/`executing` tools to `success` when no result exists. Tools without a result now keep their actual status, preventing false success reporting.
packages/db/migrations/0180_amused_marvel_boy.sql	Adds `delivered` to the `copilot_async_tool_status` enum. The migration correctly converts the column to text, drops and recreates the enum, then restores the typed column — safe migration pattern for enum extension in Postgres.

Sequence Diagram

sequenceDiagram
    participant Client as Browser Client
    participant ConfirmAPI as /api/copilot/confirm
    participant DB as Postgres DB
    participant PubSub as In-Process PubSub
    participant Orchestrator as Orchestrator (index.ts)
    participant GoAPI as Go /api/tools/resume

    Client->>ConfirmAPI: POST {toolCallId, status}
    ConfirmAPI->>DB: getAsyncToolCall(toolCallId)
    DB-->>ConfirmAPI: existing row (runId NOT NULL)
    ConfirmAPI->>DB: getRunSegment(existing.runId)
    DB-->>ConfirmAPI: run row (userId check)
    ConfirmAPI->>DB: completeAsyncToolCall / upsertAsyncToolCall
    ConfirmAPI->>PubSub: publishToolConfirmation(event)
    ConfirmAPI-->>Client: 200 OK

    Note over Orchestrator: waitForToolCompletion() is parked
    PubSub-->>Orchestrator: event arrives
    Orchestrator->>DB: getToolConfirmation(toolCallId)
    DB-->>Orchestrator: terminal status → settle

    Orchestrator->>DB: claimCompletedAsyncToolCall(toolCallId, workerId)<br/>(WHERE claimedBy IS NULL)
    DB-->>Orchestrator: claimed row
    Orchestrator->>DB: getAsyncToolCalls([toolCallId])
    DB-->>Orchestrator: durable result

    Orchestrator->>GoAPI: POST /api/tools/resume {checkpointId, results}
    GoAPI-->>Orchestrator: stream response

    Orchestrator->>DB: markAsyncToolDelivered(toolCallId)
    Note over Orchestrator: claimedToolCallIds cleared

_{Reviews (1): Last reviewed commit: "Add delegating state to subagents" | Re-trigger Greptile}

apps/sim/lib/copilot/tools/server/files/workspace-file.ts

apps/sim/app/api/copilot/chat/route.ts

waleedlatif1 and others added 30 commits February 16, 2026 00:36

v0.5.91: docs i18n, turborepo upgrade

b7e377e

v0.5.92: shortlinks, copilot scrolling stickiness, pagination

da46a38

v0.5.93: NextJS config changes, MCP and Blocks whitelisting, copilot …

fdca736

…keyboard shortcuts, audit logs

v0.5.94: vercel integration, folder insertion, migrated tracking redi…

15ace5e

…rects to rewrites

v0.5.96: sim oauth provider, slack ephemeral message tool and blockki…

34d92fa

…t support

v0.5.97: oidc discovery for copilot mcp

115f04e

v0.5.98: change detection improvements, rate limit and code execution…

0d86ea0

… fixes, removed retired models, hex integration

v0.5.99: local dev improvements, live workflow logs in terminal

af59234

v0.5.100: multiple credentials, 40% speedup, gong, attio, audit log i…

67f8a68

…mprovements

v0.5.101: circular dependency mitigation, confluence enhancements, go…

4fd0989

…ogle tasks and bigquery integrations, workflow lock

v0.5.102: new integrations, new tools, ci speedups, memory leak instr…

0d2e6ff

…umentation

v0.5.103: memory util instrumentation, API docs, amplitude, google pa…

e07e3c3

…gespeed insights, pagerduty

v0.5.104: memory improvements, nested subflows, careers page redirect…

f1ec5fe

…, brandfetch, google meet

v0.5.105: slack remove reaction, nested subflow locks fix, servicenow…

70c36cb

… pagination, memory improvements

v0.5.106: condition block and legacy kbs fixes, GPT 5.4

3ce9475

v0.5.107: new reddit, slack tools

6586c5c

v0.5.108: workflow input params in agent tools, bun upgrade, dropdown…

8c0a2e0

… selectors for 14 blocks

v0.5.109: obsidian and evernote integrations, slack fixes, remove mem…

ecd3536

…ory instrumentation

v0.5.110: webhook execution speedups, SSRF patches

1c2c2c6

v0.5.111: non-polling webhook execs off trigger.dev, gmail subject he…

36612ae

…aders, webhook trigger configs (#3530)

v0.5.112: trace spans improvements, fathom integration, jira fixes, c…

e9bdc57

…anvas navigation updates

v0.5.113: jira, ashby, google ads, grain updates

4c12914

v0.6: mothership, tables, connectors

84d6fdc

v0.6.1: added better auth admin plugin

4f3bc37

v0.6.2: mothership stability, chat iframe embedding, KB upserts, new …

4bd0731

…blog post

v0.6.3: hubspot integration, kb block improvements

30f2d1a

v0.6.4: subflows, docusign, ashby new tools, box, workday, billing bu…

ff7b5b5

…g fixes

v0.6.5: email validation, integrations page, mothership and custom to…

9fcd02f

…ol fixes

v0.6.6: landing improvements, styling consistency, mothership table r…

1731a4d

…enaming

Sg312 and others added 18 commits March 22, 2026 18:07

Reenable subagent stream

6549a50

Subagent stream

98f4dfd

Fix edit workflow hydration

1e7a987

Throw func execute error on error

0b3000a

Rewrite

e2d5d27

Remove promptForToolApproval flag, fix workflow terminal logs

8934206

Fixes

24fbf41

Rebase

a573ec8

Fix buffer

7d59763

Fix

035c614

Fix claimed by

7dce59e

Cleanup v1

f77c8b4

Tool call loop

1c0697a

Fixes

df7e635

Fixes

39ff907

Fix subaget aborts

7f83a0c

Fix diff

772929c

Add delegating state to subagents

5b7a155

greptile-apps bot reviewed Mar 24, 2026

View reviewed changes

apps/sim/lib/copilot/tools/server/files/workspace-file.ts Outdated Show resolved Hide resolved

apps/sim/app/api/copilot/chat/route.ts Show resolved Hide resolved

Fix build

a943010

vercel bot temporarily deployed to Preview March 24, 2026 00:59 Inactive

Fix sandbox

63011d9

vercel bot temporarily deployed to Preview March 24, 2026 01:04 Inactive

Fix lint

405f57c

vercel bot temporarily deployed to Preview March 24, 2026 01:06 Inactive

Sg312 merged commit 775daed into staging Mar 24, 2026
11 checks passed

waleedlatif1 deleted the improvement/tool-call-loop branch March 24, 2026 02:28

icecrasher321 mentioned this pull request Mar 24, 2026

v0.6.8: mothership tool loop #3733

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(mothership): tool call loop#3729

fix(mothership): tool call loop#3729
Sg312 merged 63 commits intostagingfrom
improvement/tool-call-loop

Sg312 commented Mar 24, 2026 •

edited

Loading

Uh oh!

vercel bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Sg312 commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Type of Change

Testing

Checklist

Uh oh!

vercel bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot commented Mar 24, 2026

Greptile Summary

Confidence Score: 2/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sg312 commented Mar 24, 2026 •

edited

Loading

vercel bot commented Mar 24, 2026 •

edited

Loading